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Abstract 

A new five-parameter continuous distribution which generalizes the Kumaraswamy and 
the beta distributions as well as some other well-known distributions is proposed and studied. 
The model has as special cases new four- and three-parameter distributions on the standard 
unit interval. Moments, mean deviations, Renyi's entropy and the moments of order statistics 
are obtained for the new generalized Kumaraswamy distribution. The score function is given 
and estimation is performed by maximum likelihood. Hypothesis testing is also discussed. 
A data set is used to illustrate an application of the proposed distribution. 

Keywords: Beta distribution; Continuous proportions; Generalized Kumaraswamy distribu- 
tion; Kumaraswamy distribution; Maximum likelihood; McDonald Distribution; Moments. 



1 Introduction 

We introduce a new five-parameter distribution, so-called generalized Kumaraswamy (GKw) 
distribution, which contains some well-known distributions as special sub-models as, for example, 
the Kumaraswamy (Kw) and beta (OS) distributions. The GKw distribution allows us to define 
new three- and four-parameter generalizations of such distributions. The new model can be 
used in a variety of problems for modeling continuous proportions data due to its flexibility in 
accommodating different forms of density functions. 
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The GKw distribution comes from the fohowing idea. Wahed (2006) and Ferreira and Steel 
(2006) demonstrated that any parametric family of distributions can be incorporated into larger 
families through an application of the probability integral transform. Specifically, let Gi{-;uj) be 
a cumulative distribution function (cdf) with corresponding probability density function (pdf) 
gi{-;Lo), and 52(';t) be a pdf having support on the standard unit interval. Here, u and r 
represent scalar or vector parameters. Now let 

rGi(x;U:) 

F{x;u,t) = / g2it;T)dt. (1) 

^0 

Note that F{-;u,t) is a cdf and that F{-;lj,t) and Gi{x]uj) have the same support. The pdf 
corresponding to ([1]) is 

f{x;u,T) = g2{Gi{x;u)-T)gi{x;u). (2) 

This mechanism for defining generalized distributions from a parametric cdf Gi{-\u) is partic- 
ularly attractive when Gi{-\u) has a closed-form expression. 

The beta density is often used in place of g2{-;T). However, different choices for Gi{-;u}) 
have been considered in the literature. Eugene et al. (2002) defined the beta normal distribu- 
tion by taking Gi{-;u) to be the cdf of the standard normal distribution and derived some of 
its first moments. More general expressions for these moments were obtained by Gupta and 
Nadarajah (2004a). Nadarajah and Kotz (2004) introduced the beta Gumbel (BG) distribution 
by taking Gi{-;uj) to be the cdf of the Gumbel distribution and provided closed form expressions 
for the moments, the asymptotic distribution of the extreme order statistics and discussed the 
maximum likelihood estimation procedure. Nadarajah and Gupta (2004) introduced the beta 
Frechet (BF) distribution by taking Gi{-;uj) to be the Frechet distribution, derived the analyt- 
ical shapes of its density and hazard rate functions and calculated the asymptotic distribution 
of its extreme order statistics. Also, Nadarajah and Kotz (2006) dealt with the beta exponen- 
tial (BE) distribution and obtained its moment generating function, its first four cumulants, 
the asymptotic distribution of its extreme order statistics and discussed maximum likelihood 
estimation. 

The starting point of our proposal is the Kumaraswamy (Kw) distribution (Kumaraswamy, 
1980; see also Jones, 2009). It is very similar to the beta distribution but has a closed-form cdf 
given by 

Gi(x;cj) = 1 - (1 -x")'^, < X < 1, (3) 

where w = (a, /3)^, a > and /? > 0. Its pdf becomes 

gi{x;u) = apx^'-^il-x'^f'^, 0<x <1. (4) 

If X is a random variable with pdf (jj]), we write X ~ Kw(a,/3). The Kw distribution was 
originally conceived to model hydrological phenomena and has been used for this and also for 
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other purposes. See, for example, Sundar and Subbiah (1989), Fletcher and Ponnambalam 
(1996), Seifi et al. (2000), Ganji et al. (2006), Sanchez et al. (2007) and Courard-Hauri (2007). 

In the present paper, we propose a generalization of the Kw distribution by taking Gi{-]to) 
as cdf ([3]) and g2{'',T) as the standard generalized beta density of first kind (McDonald, 1984), 
with pdf given by 

52 x;t = , < X < 1, (5) 

5(7, r/) 

where r = (7,r/,A)^, 7 > 0,7] > and A > 0, B{'y,ri) = T{'y)T{ri)/T{'y + rj) is the beta 
function and r(-) is the gamma function. If X is a random variable with density function ([5]), 
we write X ~ GB1(7, t], A). Note that if X ~ GB1(7, r], 1) then X ~ <B(7, r/), i.e., X has a beta 
distribution with parameters 7 and rj. 

The article is organized as follows. In Section 2, we define the GKw distribution, plot its 
density function for selected parameter values and provide some of its mathematical properties. 
In Section 3, we present some special sub-models. In Section 4, we obtain expansions for the 
distribution and density functions. We demonstrate that the GKw density can be expressed as 
a mixture of Kw and power densities. In Section 5, we give general formulae for the moments 
and the moment generating function. Section 6 provides an expansion for the quantile function. 
Section 7 is devoted to mean deviations about the mean and the median and Bonferroni and 
Lorenz curves. In Section 8, we derive the density function of the order statistics and their 
moments. The Renyi entropy is calculated in Section 9. In Section 10, we discuss maximum 
likelihood estimation and determine the elements of the observed information matrix. Section 
11 provides an application to a real data set. Section 12 ends the paper with some conclusions. 



2 The New Distribution 

We obtain an appropriate generalization of the Kw distribution by taking Gi(-;a;) as the 
two-parameter Kw cdf Q and associated pdf (jl]). For g2{-;T), we consider a three-parameter 
generalized beta density of first kind given by ([5]). To avoid non-identifiability problems, we 
allow r] to vary on [1, 00) only. We then write 6 = 7]—! which varies on (0, 00). Using ([T|, the 
cdf of the GKw distribution, with five positive parameters a, (3, 7, 6 and A, is defined by 

where 6 = (a, /3, 7, 5, A)^ is the parameter vector. 

The pdf corresponding to ^ is straightforwardly obtained from ^ as 

^) = Bhf+l) ~ "'^^^'^''^ ^"^'^l^'^'^l - [1 - (1 - < X < 1. (7) 
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Based on the above construction, the new distribution can also be referred to as the McDonald 
Kumaraswamy (McKw) distribution. If X is a random variable with density function d?]), we 
write X ~ GKw(a, 7, 6, A). 

An alternative, but related, motivation for ^ comes through the beta construction (Eugene 
et al., 2002). We can easily show that 

F(x;6>) = /[i_(i_,.)^]a(7,<5 + 1), (8) 

where Ix{a, h) = B{a, h)~^ uj'^^^{l—u:)^^^du^ denotes the incomplete beta function ratio. Thus, 
the GKw distribution can arise by taking the beta construction applied to a new distribution, 
namely the exponentiated Kumaraswamy (EKw) distribution, to yield ([7]), which can also be 
called the beta exponentiated Kumaraswamy {BEKw) distribution, i.e., a beta type distribution 
defined by the baseline cumulative function G{x) = [1 — (1 — x")^]'*'. 

Immediately, inverting the transformation motivation ([8]), we can generate X following the 
GKw distribution by X = [1 - (1 - ^^i^^^ y 

is a beta random variable with 
parameters 7 and 5 + 1. This scheme is useful because of the existence of fast generators 
for beta random variables. Figure [T] plots some of the possible shapes of the density function 
([7|). The GKw density function can take various forms, bathtub, J, inverted J, monotonically 
increasing or decreasing and upside-down bathtub, depending on the parameter values. 

We now provide two properties of the GKw distribution. 

Proposition 1. If X ~ GKw(l, /3, 7, 5, A), then Y = X^/" ~ GKw(a, /3, 7, 6, A) for a > 0. 

Proposition 2. Let X ~ GKw(a, /3, 7, 5, A) and Y = — log(X). Then, the pdf of Y is given by 

/(y; ^) = - - (1 - e-'^yfV^-^l -[!-(!- e-^yffY, y > 0. (9) 

5(7, <5 + 1) 

We call ([9]) the log-generalized Kumaraswamy (LGKw) distribution. 

3 Special Sub-Models 

The GKw distribution is very flexible and has the following distributions as special sub- 
models. 

The Kumaraswamy distribution (Kw) 

If A = 7 = 1 and 5 = 0, the GKw distribution reduces to the Kw distribution with parameters 
a and /?, and cdf and pdf given by ([3]) and respectively. 
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(g) (h) (i) 

Figure 1: GKw density curves, (a) = (a, 3.5, 1-5, 2.5, 0.5)^ , (b) = (3.5, /3, 1.5, 2.5, 0.5)"^ , 
(c) e = (1.0, 1.5,7,2.5, 0.5)T, (d) 6 = (1.0, 1.5, 2.5, 0.5)^ , (c) 6 = (0.5, 0.7, 0.1, 3.0, A)"^ , (f) 
e = (a, /3, 2.5, 0.1, 0.5)^, (g) 6 = (a, 1.5, 2.5,5, 0.5)^, (li) = (0.5,0.7,0.15,5, A)^, (i) 9 = 
(0.5, 1.0, 7, 0.3, A)T. 
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The McDonald distribution (Mc) 

For a = P = 1, we obtain the Mc distribution 1^ with parameters 7, 5 + 1 and A. 

The beta distribution 

Ifa = /3 = A = 1, the GKw distribution reduces to the beta distribution with parameters 7 
and 6 + 1. 

The beta Kumaraswamy distribution (BKw) 

If A = 1, dZI) yields 

fix; a, (3, 7, 5, 1) = f'l il - x^f^'+^)-^ [1 - (1 - x^fV-\ < x < 1. 
5(7,5 + 1) 

This distribution can be viewed as a four-parameter generahzation of the Kw distribution. We 
refer to it as the BKw distribution since its pdf can be obtained from ^ by setting Gi{x] u) to 
be the Kw(a, /3) cdf and g2{x] r) to the *B(7, (5 + 1) density function. 

The Kumaraswamy-Kumaraswamy distribution {KwKw) 

For 7 = 1, dZI reduces to (for < a; < 1) 

fix; a, /3, 7, ,5, A) = Aa/3(5 + l)x'''\l - x^'f-^il - (1 - x''f]^-^{l -[!-(!- x")'^]^}^ 

Again, this distribution is a four-parameter generahzation of the Kw distribution. It can be 
obtained from ([2]) by replacing Giix;uj) by the cdf of the Kw(a,/3) distribution and g2ix;T) by 
the pdf of the Kw(7, 5 + 1) distribution. Its cdf has a closed form given by 

Fix; a, /3, 1, J, A) = 1 - {1 - [1 - (1 - x''f]^Y+' . 

The EKw distribution 

If (5 = and 7 = 1, d?]) gives 

/(x;a,/3,l,0,A) = Aa/3x"-^(l - - (1 - < x < 1. 

It can be easily seen that the associated cdf can be written as 

F(x;a,/3,1,0,A) =Gi(x;a,/3)\ 

where Gi(x;a,/3) is the cdf of the Kw(a,/3) distribution. This distribution was defined before 
as the EKw distribution which is a new three-parameter generalization of the Kw distribution. 
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The beta power distribution {BP) 

For a = 1 and /3 = 1, Q reduces to 

/(x; 1,1, 7, 5, A) = ^ _ ^7A-i(i_^A)^^ 0<x<l. 

5(7, J + 1) 

This density function can be obtained from ^ if Gi{x) = and g2{x) is taken as the beta 
density with parameters 7 and 5 + 1. We cah this distribution as the BP distribution. 

The LGKw distribution (0) contains as special sub- models the following distributions. 

The beta generahzed exponential distribution (BGE) 

For A = 1, ([9|) gives 

f{y; a, 1, 7, 6, A) = „, "f^i, e-°^(l " e-'^yf^'+'^-' [1 - (1 - e-^)'^]^-\ y > 0, (10) 

which is the density function of the BGE distribution introduced by Barreto-Souza et al. (2010). 
If 7 = 1 and (5 = in addition to A = 1, the LGKw distribution becomes the generalized 
exponential distribution (Gupta and Kundu, 1999). IfA = /3 = 7 = l and 5 = 0, (fTOj) coincides 
with the exponential distribution with mean 1/a. 

The beta exponential distribution (BE) 

For (3 = 1 and A = 1, Q reduces to 

f{y;a,l,j,5,l)= e-"^^(l - e-"^)^ y > 0, 

S(7,d + 1) 

which is the density of the BE distribution introduced by Nadarajah and Kotz (2006). 

4 Expansions for the Distribution and Density Functions 

We now give simple expansions for the cdf of the GKw distribution. If l-z] < 1 and 5 > is 
a non-integer real number, we have 

00 

(i-z)' = ^{-iy{5),z^, (11) 

j=0 
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where {6)j = 5{6 — l)...{6 — j + l) (for j = 0, 1, . . . ) is the descending factorial. Clearly, if 5 is a 
positive integer, the series stops at j = 5. Using the series expansion (jlip and the representation 
for the GKw cdf Q , we obtain 

if (5 is a non-integer real number. By simple integration, we have 

oo 

F{x;9) = Y,^AGi{x;a,/3)]^^''+^\ (12) 

j=0 

where 

and Gi(x; a, /3) is given by ([3]). If 5 is a positive integer, the sum stops at j = 6. 

The moments of the GKw distribution do not have closed form. In order to obtain expansions 
for these moments, it is convenient to develop expansions for its density function. From (I12p . 
we can write 

oo 

fix; 9) = J2 ^jKl + j)9i {x; «, /3) [Gi (x; a, /3)]^(^+^)-i . 

j=0 

If we replace Gi{x] a, (3) by ([3]) and use (jl]), we obtain 

oo 

f{x;e) = Y,Pk9i{x\a,{k + l)P), (14) 

fc=0 

where pfc = Sjlo^i '^ith tj^^ = ((/))fc A(7 + j)(-l)''/(A; + 1). Here, </) = (7 + j)A - 1 and 
gi{x; a, (/c + 1)^5) denotes the Kw(a, {k + l)/3) density function with parameters a and (A; + l)/3. 
Further, we can express (I14p as a mixture of power densities, since the Kw density (jl]) can also 
be written as a mixture of power densities. After some algebra, we obtain 

00 

f{x;e)=Y,V^X^'+^^^-\ (15) 

where 



{-lYaPY.{k + l){{k + l)l3-l)iPk. 



Equations (jl4p and (|15p are the main results of this section. They can provide some mathemat- 
ical properties of the GKw distribution from the properties of the Kw and power distributions, 
respectively. 
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5 Moments and Moment Generating Function 

Let X be a random variable having the GKw distribution d?]). First, we obtain an infinite 
sum representation for the rth ordinary moment of X, say //^ = E{X^). From p4p. we can write 

oo 

f^'r = '^PkTr{k), (16) 

fc=0 

where Tr{k) = x^gi{x; a, (k + l)/3)dx is the rth moment of the Kw(a, {k + l)/3) distribution 
which exists for all r > —a. Using a result due to Jones (2009, Section 3), we have 

Tr{k) = {k + l)pB(l + ^,{k + l)/3y (17) 



Hence, the moments of the GKw distribution follow directly from (I16p and (I17p . The central 
moments (fig) and cumulants (k^) of X are easily obtained from the ordinary moments by 
l^s = ELo {k)(-'^)''^^l^^'s-k and ki = fi[, K2 = ^2 - , ^3 = ^3 - 3/i2//'i + 2fif, K4 = - 
4/i^j/i;-3^f + 12^'2/^f-6/i'/, K5 = /i'5-5/i>'i-10^'3/^'2 + 20^^/if +30/i'2V'i-60^'2^f +2V/,K6 = 

etc., respectively. The rth descending factorial moment of X is 

r 

^'(,) = E[XM] = E[X{X - 1) X . . . X (X - r + 1)] = ^ s{r, m) fi'^, 

m=0 

where s{r,m) is the Stirling number of the first kind given by s{r,m) = (m\)~^d"^x^^^ /dx"^\x=o- 
It counts the number of ways to permute a list of r items into m cycles. Thus, the factorial 
moments of X are given by 

00 r 

/^(r) = J2p''Y1 s{r,m)T„i{k). 
The moment generating function of the GKw distribution, say M{t), is obtained from (jl5p 

as 



00 „i 
M{t) = V f» / exp 

•''0 



{tx)dx. 



1=0 

By changing variable, we have 



M(t) = V / n(*+i)°-i exp(-n)dn 

-'0 



i=0 

and then M{t) reduces to the linear combination 

7((i + l)a,t) 



M{t) = ^ 



00 

Vi 



i=0 

where 7(0, x) = Jq u'^~^ exjp{—u)du denotes the incomplete gamma function. 
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6 Quantile Function 

We can write dH) as F{x] 9) = Iz{'^, 5 + 1) = u, where z = [1 — {1 — x°')^]'^. From Wolfram's 
websit^ we can obtain some expansions for the inverse of the incomplete beta function, say 
z = Qb{u), one of which is 

z = Qb{u) = aiv + a2v'^ + 031;^ + a^v^ + 0{v^/'^), 

where v = [^uB{'^^5 + 1)]^/'^ for 7 > and Oq = 0, ai = 1, 02 = (5/(7 + 1), 

(^[72 + 3(5 + 1)7 - 7 + 5<5 + 1] 



as 



2(7 + 1)2(7 + 2) 



04 = 5{7^ + (6(5 + 5)7^ + (5 + 3) (8J + 3)7^ + [33(5 + 1)^ - 30J + 26]7 
+ {5 + 1)(315 - 16) + 18}/[3(7 + 1)^(7 + 2)(7 + 3)], . . . 

The coefficients a^s for i > 2 can be derived from the cubic recursion (Steinbrecher and Shaw, 
2007) 

i-l 



1 ( 

= [,2 + _ 2)i + (1 - 7)] I - P^''^ E a^+l-r [r(l - - v) 

1 i—r 

- r{r - 1)] + ^ ^ Or as ai+i-r-s [r{r - 7) + 5(7 + ^ - 2) 

r=l s=l 



X (i + l-r-s)]|, 

where pi^2 = 1 if i = 2 and pi^2 = if i 7^ 2. In the last equation, we note that the quadratic 
term only contributes for i > 3. Hence, the quantile function Qgkw{u) of the GKw distribution 
can be written as Qgkw{u) = {1 - [1 - Qb('u)^/^]^/^}^/". 

7 Mean Deviations 

liX has the GKw distribution, we can derive the mean deviations about the mean p'^ = E{X) 
and about the median M from 



61= \x — p'i\ f{x;0)dx and 62 = 
Jo Jo 



x-M\ f{x;e)dx, 



^http://functions.wolfram.com/06. 23.06.0004.01 
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respectively. From ([8]) , the median M is the solution of the nonlinear equation 

-^[i-(i-M">'3]^(7,'^ + 1) = 1/2- 
These measures can be calculated using the relationships 

5i = 2[^;F(^;; 6) - J(/i'i; 6>)] and 82 = fi[ - 2J(M; 6). 
Here, the integral J{a;0) = xf{x;9)dx is easily calculated from the density expansion ([15 



as 



°° {i+l)a+l 



i=Q 

We can use this result to obtain the Bonferroni and Lorenz curves. These curves have 
applications not only in economics to study income and poverty, but also in other fields, such 
as reliability, demography, insurance and medicine. They are defined by 

Bip-e)='I^ and Lip;e) = ^, 
respectively, where q = F^^{p;9). 

8 Moments of Order Statistics 

The density function of the ith order statistic say fi;n{x; 0), in a random sample of size 
n from the GKw distribution, is given by (for i = 1, • • • , n) 

U.nix; 9) = TjT-^-— rT/(^; d)F{x; ey-^{l - F{x- 9)r-\ < x < 1. (18) 

The binomial expansion yields 

1 / — 1 \ 

B{i,n-i + l) J J 

and using and integrating (jl5p we arrive at 

n— 1 / ,\ / 00 \ 

B{i, n 



,t=0 / j=0 ^ •' / \s=0 



where v* = Vs[{s + l)a] ^. 
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We use the following expansion for a power series raised to a integer power (Gradshteyn and 
Ryzhik, 2000, Section 0.314) 

OO \ p oo 

Y^ajxA =X]ci,pa;^ (19) 
j=o ^ j=0 

where -p is any positive integer number, co,p = afj and c^^p = (sao)^"*^ Sj=i ^^"P ~ + j)^j'^s--j,p for 
all s > 1.] We can write 

n-l 

{s+t+i+j)a-l 



^ ^ i?(i,n-i + i) V i ^ 



s,i+j — 1 2^ 

j=0 \ / s,t=0 



where eo,i+j-i = Vq'"*^"' and (for s > 1) 



es,i+j_i = (svq) ^ ^ [m(i + j - 1) - s + m]v'^es-m,i+j-i- 

m=l 



The ri/i moment of the ith order statistic becomes 

^(^-) - g ( ; ) E^v^m^a 

We now obtain another closed form expression for the moments of the GKw order statistics 
using a general result due to Barakat and Abdelkader (2004) applied to the independent and 
identically distributed case. For a distribution with pdf f{x; 0) and cdf F{x] 0), we can write 

m=n— i+l 

where 



lm{r)= C x'-^{l-F{x-e)rdx. 
Jo 



For a positive integer m, we have 

»1 m. 



Im{r) = x^-~'Y.{ 7 Y-mF{x;e)fdx. 



p=0 

By replacing (|12p in the above equation we have 

m „i / oo \P 

Im{r) = Y.{-iy( ) / x'-'{ Y.u;,[G,{x;a,P)]'^'r+j)\ ^x. (21) 
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Equations and (HI]) yield 



m „i oo 

p=0 ^ j=0 
By replacing Gi{x;a,(3) by ([3]) and using (fTTj) we obtain 

m cxD 

^™(^) = E ( 7 ) ^ '^'^ ^^^^ / "^'"'^^ " ^'^^^^'^^^ 

p=0 j,w=0 ■'^ 

where ■0 = -^(7 + j)- Since B{a/b,c) = b fQW°'~^{l — w^Y~^dw for a,6,c > (Gupta and 
Nadarajah, 2004b), we have 



Im{r) = '^ ^ Spj>S(-,/?tU + 1), 
p=0 j,w=0 



where 



(-l)P+'"m! 
a(rn- — p)\p\ 



Finally, E{Xl.^) reduces to 



n 



E(xij=r ^ (-ir— -'(::; )(::)EEv.»5&'S'"+i) - (22) 

m=n— j+l I. p=0 j,«)=0 J 

Equations (|20|) and (|22p are the main results of this section. The L-moments are analogous 
to the ordinary moments but can be estimated by linear combinations of order statistics. They 
are linear functions of expected order statistics defined by (Hoskings, 1990) 



Xr+i = {r + l)-'j2{-lf(l)E{X^ 

k=0 ^ ^ 



-l-fc:r+l), r = 0, 1, . . . 



The first four L-moments are Ai = E{Xi-i), A2 = ^E{X2:2 — -^^1:2)5 A3 = ii?(X3:3 — 2X2:3 + ^1:3) 
and A4 = ^E{X4-4 — 8X3:4 + 8X2:4 — Xi:4). These moments have several advantages over the 
ordinary moments. For example, they exist whenever the mean of the distribution exists, even 
though some higher moments may not exist, and are relatively robust to the effects of outliers. 
From (I22p applied for the means (r = 1), we can obtain expansions for the L-moments of the 
GKw distribution. 
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9 Renyi Entropy 



The entropy of a random variable X with density function f[x) is a measure of variation of 
the uncertainty. One of the popular entropy measures is the Renyi entropy given by 

1 



Jr{p) 



l-p 



log 



fP{x)dx 



, p>0, p^l. 



From (|15p . we have 



fix;er={j2 



ViX 



In order to obtain an expansion for the above power series for /? > 0, we can write 

j 



e(;)(-)4-(e 

j=o ^■'^ L \i=o 



ViX 



oo J 



j=0 r=Q 



J J V 



■ E 



ViX 



,i=Q 



Using equation (|T9|) . we obtain 



oo J 



where do,r- = ds,r = (•sfo)~^ X]m=i("^'^ — s + m)vr)ids-m,T for all s > 1. Hence, 



Jr{p) 



1 



log 



oo ] 

EE 



(i + r)a — r + 1 



(23) 



■ i,j=0 r=0 

10 Maximum Likelihood Estimation 

Let Xi,X2, ■ ■ ■ ,Xn be a random sample from the GKw(A, a, /3, 7, 5) distribution. From ([7]) 
the log-likelihood function is easy to derive. It is given by 



i=l 



i{e) = nlog(A)+nlog(a) + nlog(/3)-nlog[B(7,5 + l)] + (a-l)^log(xi) 
n n 

+(/3 - 1) log(l - x2) + (7A - 1) J] log[l - (1 - xlf] + 

i=l 1=1 

n 

jj]iog[i-{i-(i-xnY]- 
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By taking the partial derivatives of the log-likelihood function with respect to A, a, /3, 7 and S, 
we obtain the components of the score vector, U{0) = {Uq,, C/g, Uj, Us, U\). They are given by 

n n ■ n 

^ + - - l)zi] \og{xi) + (7A - 1) ^ ^ - (^A ^ 

4=1 i=l i=l 

71 Tl • Tl 

- + log(l - xt) + (7A - 1) E ^ - A<55^ 

^ i=i i=i i=i 

n 

-n[V'(7) - ^-(7 + 5 + 1)] + A log(yi), 

4=1 

-n[V;(,5 + 1) - V;(7 + ^ + 1)] + log(l - y^), 

1=1 

n 
i=l 

where -;/'(•) is the digamma function, = 1 — (1 — )^, Vi = y^~^{l — y^^)^^, Zi = xf{l — xf)~^, 
Viia) = dyi/da = -/3xf{l - xf)^-^ log{xi) and yj(^) = dyi/d/3 = -(1 - xf)^log(l - xf).For 
interval estimation and hypothesis tests on the model parameters, the observed information 
matrix is required. The observed information matrix J = J{6) is given in the Appendix. 

Under conditions that are fulfilled for parameters in the interior of the parameter space, 
the approximate distribution of \fn{Q — 6) is multivariate normal A^5(0, where 6 is the 
max;imum likelihood estimator (MLE) of 6 and I{6) is the expected information matrix. This 
approximation is also valid if I (6) is replaced by J (9). 

The multivariate normal Nr^{0, J^^(6)) distribution can be used to construct approximate 
confidence regions. The well-known likelihood ratio {LR) statistic can be used for testing hy- 
potheses on the model parameters in the usual way. In particular, this statistic is useful to check 
if the fit using the GKw distribution is statistically superior to a fit using the BKw, EKw and 
Kw distributions for a given data set. For example, the test of Hq : A = 1 versus Hi : A 7^ 1 is 
equivalent to compare the BKw distribution with the GKw distribution and the LR statistic re- 
duces to u; = 2[i{a, /3, 7, 5, A) — £{a, /3, 7, 5, 1)], where 6 and 6 are the unrestricted and restricted 
MLEs of 6, respectively. Under the null hypothesis, w is asymptotically distributed as Xi- For 
a given level the LR test rejects Hq if w exceeds the (1 — C)-quantile of the Xi distribution. 

11 Application 

This section contains an application of the GKw distribution to real data. The data are 
the observed percentage of children living in households with per capita income less than R$ 
75.50 in 1991 in 5509 Brazilian municipal districts. The data were extracted from the Atlas of 



UaiO) = 

um = 

u,{e) = 

Usie) = 
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Brazil Human Development database available at http://www.pnud.org.br/ The histogram of 
the data is shown in Figure [2] along with the estimated densities of the GKw distribution and 
some special sub-models. Apparently, the GKw distribution gives the best fit. 

The GKw model includes some sub-models described in Section [3] as especial cases and 
thus allows their evaluation relative to each other and to a more general model. As mentioned 
before, we can compute the maximum values of the unrestricted and restricted log-likelihoods 
to obtain the LR statistics for testing some sub-models of the GKw distribution. We test 
Hq : (a,/3,A) = (1,1,1) versus Hi : Hq is not true, i.e. we compare the GKw model with the 
beta model. In this case, w = 2{1510.7 - 1271.6)} = 239.1 (p-value< 0.001) indicates that the 
GKw model gives a better representation of the data than the beta distribution. Further, the 
LR statistic for testing Hq : A = 1 versus Hi : A 7^ 1, i.e. to compare the GKw model with 
the BKw model, is u; = 2(1510.7 - 1383.6) = 254.2 (p-value< 0.001). It also yields favorable 
indication for the GKw model. Table [T] lists the MLEs of the model parameters (standard errors 
in parentheses) for different models. The computations were carried out using the subroutine 
MAXBFGS implemented in the Ox matrix programming language (Doornik, 2007). 



Table 1: MLEs of the model parameters. 



Distribution 


a 




/3 


7 


<5 


A 


m 


GKw 


18.1161 


1 


8132 


0.7303 


0.0609 


15.7803 


1510.6670 




(0.1829) 


(0 


0219) 


(0.0057) 


(0.0008) 


(0.8908) 




BKw 


0.0247 





1849 


26.0933 


17.3768 




1383.5690 




(0.0003) 


(0 


0005) 


(0.1054) 


(0.0739) 






KKw 


2.7191 





4654 




0.0968 


79.9999 


1405.0650 




(0.0086) 


(0 


0060) 




(0.0005) 


(1.0915) 




PKw 


17.9676 
(0.2421) 



(0 


1647 
0019) 






1.1533 
(0.0054) 


1237.5800 


BP 








0.1590 
(0.0018) 


16.7313 
(0.1998) 


0.2941 
(0.0129) 


1269.9760 


Kw 


2.4877 
(0.0295) 


1 

(0 


3369 
0180) 








1278.7860 


Beta 








2.5678 
(0.0317) 


0.3010 
(0.0147) 




1271.5610 



12 Conclusions 

We introduce a new five-parameter continuous distribution on the standard unit interval 
which generalizes the beta, Kumaraswamy (Kumaraswamy, 1980) and McDonald (McDonald, 
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FigTirc 2: Histogram and estimated pdf 's for the percentage of children hving in households with 
per capita income less than R$ 75.50 (1991) in 5509 Brazilian municipal districts. 
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1984) distributions and includes as special sub-models other distributions discussed in the hter- 
ature. We refer to the new model as the generalized Kumaraswamy distribution and study some 
of its mathematical properties. We demonstrate that the generalized Kumaraswamy density 
function can be expressed as a mixture of Kumaraswamy and power densities. We provide the 
moments and a closed form expression for the moment generating function. Explicit expressions 
are derived for the mean deviations, Bonferroni and Lorenz curves and Renyi's entropy. The 
density of the order statistics can also be expressed in terms of an infinite mixture of power 
densities. We obtain two explicit expressions for their moments. Parameter estimation is ap- 
proached by maximum likelihood. The usefulness of the new distribution is illustrated in an 
analysis of real data. We hope that the proposed extended model may attract wider applications 
in the analysis of proportions data. 
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Appendix 

The elements of the observed information matrix J {6) for (a, /3, 7, 5, A) are 
Jaa = - ^ - (/3 - 1) ^ ij(a) log(a;j ) + 

" i=l i=l ^ 

n 



1=1 



i=l i=l < Vi ) i=i 

n ^ n n 



i=l i=l i=l 



18 



n • n n • n 

= A2^-— , Jps = ->^2^Viyi^f^-j, JfiX = l2^-—-5}_^Viyi^ii), 

1=1 1=1 1=1 1=1 

J^^ = -n{ip\-i)-il}'{-i + 5 + l)}, = raV''(7 + (^ + 1), -A-A = XI ^^S^^^^)' 

i=l 

n 

Jss = -n{tp'{5 + l)-^'{rf + d + l)}, Jsx = -^ymlogiyi), 

1=1 

n " 

1=1 

where i^f,,) = azj/Sa = (1 + z^^i logC^O' Viia) = d'^yi/da^ = {1 - (/? - log(xj), 
ym = d'^yi/dP'^ = m^f}) log(l - x'^), in^^p) = d^yi/daOP = {1//3 + log(l + xf)}yi(^a) , ^'i(a) = 
dvi/da = {(A - + Xvi}viyi(^a), ^'j(/3) = ^^'j/^/? = {(A - l)/yi + )^Vi}viyii^p), Vi^x) = dvi/dX = 
(1 + yiVi)vi log(yi) and (•) is the first derivative of the digamma function. 
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